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(57) Abstract: The invention relates to methods for analyzing polysaccharides. In particular, compositional and sequence infor- 
mation about the polysaccharides are derived. Some methods use NMR in conjunction with another experimental method, such as, 
capillary electrophoretic techniques for the analysis. 
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NOVEL METHOD FOR SEQUENCE DETERMINATION USING NMR 

FIELD OF THE INVENTION 
The invention relates to methods for analyzing compositional and sequence 
5 information of polysaccharides. 



BACKGROUND OF THE INVENTION 
Heparin and heparan sulfate glycosaminoglycans are complex acidic 
polysaccharides that are involved in a variety of physiological and pathological 

10 conditions. Advances in different areas of biology have elucidated the potential roles of 
HSGAGs in key biological processes (Casu and Lindahl, 2001; Lindahl, 2000; 
Sasisekharan and Venkataraman, 2000; Shriver et aly 2002) including thrombosis 
(Petitou et al, 1999), angiogenesis (Sasisekharan et al, 1997), viral invasion (Chen et 
aL 1997; Fry et al, 1999; Shukla et al, 1999) and tumor growth (Hulett et al, 1999; 

15 Vlodavsky et al, 1999; Liu et al 2002). The repeat unit of a HSGAG polymer is a 
disaccharide comprising a uronic acid (U), which can exist in two different epimeric 
forms - a-L-iduronic (I) or p-D-glucxironic (G), linked 1-^4 to a a-D-glucosamine 
residue (A). There are variations within the disaccharide imit in the form of sulfation at 
the 2-0 position of the uronic acid, 3-0 and 6-0 position of the glucosamine and 

20 sulfation or acetylation of the N-position of the glucosamine (Casu and Lindahl, 2001). 

Perhaps the best studied structure-activity relationships in HSGAGs is a 
pentasaccharide sequence in heparin that specij5cally binds to and activates 
antithrombin-in thereby playing an inhibitory role in the blood coagulation cascade 
(Bourin and Lindahl, 1993). Heparin and its derivatives, low molecular weight heparins 

25 (LMWHs), are the most widely used clinical agents for prevention of deep vein 
thrombosis after surgery (Breddin, 2000) and for prevention of myocardial infarction 
after coronary invasion procedures (Cohen, 1999). Based on the anticoagulant properties 
of heparin, new therapeutic ^pUcations of heparin are being envisaged (Rosenberg, 
2001). A synthetic version of the pentasaccharide has been used as an antithrombotic 

30 drug (Turpie et al, 2001). 

In order to understand the structure-activity relationship of HSGAGs, several 
analytical tools have been developed for sequencing oligosaccharides which include gel 
electrophoresis (Tumbull et al, 1999), HPLC (Vives et al, 1999), matrix assisted laser 
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desorption/ionization mass spectrometry (MALDI-MS) (Venkataraman et al, 1999) and 
nanoelectrospray mass spectrometry (Pope et al, 2001). These analytical fools have 
been applied to dissect the HSGAG oligosaccharide into smaller fragments using a 
battery of depolymerizing enzymes and other chemical methods and determining the 
5 sequence of the oUgosaccharides based on specific properties of the smaller fragments 
(Kreugerefa/. 2001). 

SUMMARY OF THE INVENTION 
The invention relates, in part, to an analytical tool for analyzing oligosaccharides, 

10 such as HSGAGs. Determining the sequence and/or composition of an ohgosaccharide 
is helpful for elucidating the structure-function relationship of oligosaccharides in key 
biological processes. 

In some aspects, a method of determining the composition of an ohgosaccharide 
is provided. The method involves obtaining a measurement of a first property of the 

15 ohgosaccharide using NMR spectroscopy, and obtaiimg a measurement of a second 
property of the oligosaccharide by a second experimental method, wherein the first and 
second properties determine the coniposition. In one embodiment, the second property 
of the ohgosaccharide is measured by capillary electrophoresis. 

A method of analyzing an ohgosaccharide is provided according to other aspects. 

20 The method involves obtaining a measurement of a first type of disaccharide linkage of 
the ohgosaccharide by a first experimental method, and obtaining a measurement of a 
second type of disaccharide linkage of the ohgosaccharide by a second experimental 
method, to analyze the oligosaccharide. In one embodiment the first type of disaccharide 
linkage is measured by NMR spectroscopy. In another embodiment the second type of 

25 disaccharide linkage is measured by capillary electrophoresis. 

Accordmg to another aspect, a method of analyzing an ohgosaccharide, by 
identifying a first property of the ohgosaccharide by NMR spectroscopy, and identifying 
a second property of the ohgosaccharide by capillary electrophoresis, to analyze the 
ohgosaccharide is provided. 

30 The methods, in some embodiments, involve determining possible sequences of 

the ohgosaccharide that are consistent with the measurement from the NMR 
spectroscopy and second experimental method. 
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In other embodiments the methods involve constructing a list of possible 
sequences based on the measurement from the NMR spectroscopy, and eliminating 
sequences from the hst of possible sequences that are not consistent with the 
measurement of the second experimental method. 

5 The second experimental method may be used to distinguish the reducing and 

non-reducing ends of the oHgosaccharide or fragments thereof. In one embodiment, the 
second experimental method includes chemical degradation. In another embodiment the 
second experimental method includes end-labeling. 

The second experimental method may also be used to determine the signature of 

10 the reducing end of the oligosaccharide or fragments thereof. In one embodiment the 
signature of the reducing end is determined with capillary electrophoresis. The second 
experimental method may allow the determination of the sulfation pattern of the second 
type of disaccharide Unkage. 

The NMR spectroscopy, in some embodiments, includes the determination of the 

15 sulfation pattern of the oligosaccharide or fragments thereof In another embodiment the 
NMR is performed to identify and quantify both reducing and non-reducing ends. 

The methods may optionally involve obtaining a measure of an additional 
property of the oligosaccharide by a third experimental method to further eliminate 
sequences not consistent with measurements obtained from the third experimental 

20 method. 

In some embodiments the NMR spectroscopy is ID proton or 2D 
COSY/TOCSY. In other embodiments, the NMR spectroscopy is HSQC, DQF-COSY, 
NOESY or ROESY. The NMR spectroscopy may also be any combination of the above. 

The NMR spectroscopy may be performed on tiie oligosaccharide in its intact 
25 form. Altematively, it may be performed on the oligosaccharide in a fragmented form. 
The fragmented form may be produced by enzymatic digestion. Enzymatic digestions 
may be complete or, in the alternative, partial. 

The second experimental method may also involve digesting the oligosaccharide 
to a fragmented form, e.g., optionally produced by enzymatic digestion. In one 
30 embodiment the enzymatic digestion is complete. 

In another aspect, a method of generating a Kst of possible sequences of an 
oligosaccharide is provided. The method involves defining a set of properties of the 
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oligosaccharide by performing NMR spectroscopy and a second experimental method, 
wherein the NMR spectroscopy provides a measurement of a first type of disaccharide 
UnJcage and the second experimental method provides a measurement of a second type of 
disaccharide linkage, and constructing a hst of possible sequences based on the set of 
5 properties of the oUgosaccharide. In one embodiment the NMR spectroscopy includes a 
measure of the monosaccharide composition of the oligosaccharide. 

The method may involve a data structure which represents the properties as non- 
character values. Li one embodiment the data structure includes a value for each type of 
monosaccharide. In another embodiment the data structure encodes a value for each type 
10 of disaccharide hnkage. In another embodiment the values are bmary. 

A Ust of possible sequences of an oligosaccharide produced jfrom the methods is 
also provided. 

In the above aspects and embodhnraits the labels of "first" and "second" 
experimental methods are not intended to denote the order in which the experiments need 

15 be performed. In some embodiments, the NMR spectroscopy may be performed before 
the other experimental methods, while in other embodiments, the NMR spectroscopy 
may be performed after the other method. In still other embodiments the two or more 
experimental methods maybe performed concurrently. 

Each of the embodiments of the invention can encompass various recitations 

20 made herein. It is, therefore, anticipated that each of the recitations of the invention 
involving any one element or combinations of elements can, optionally, be included in 
each aspect of the invention. 

BRIEF DESCRIPTION OF THE DRAWINGS 
25 Figure 1 is a representation of the (A) disaccharide building block of heparin and 

heparan sulfate polysaccharides (left) with positions of sulfation marked as X (Y position 
can be sulfated or acetylated). The hexadecimal code (middle) to represent the 
disaccharide repeat unit in the PEN firamework was described in our previous study 
(Venkataraman et aL, 1999). The decomposition of the disaccharide hexadecimal code to 
30 a base4 code for the uronic acid and baseg (octal) code for the glucosamine is shown on 
in (B) The base4 codes for the (i) uronic acids, (ii) octal codes for glucosamines, (iii) 
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signed hexadecimal codes for A-U linkages and (iv) AU-A linkages observed in the 
oligosaccharides used in this study. 

Figure 2 shows the (A) 500 MHz spectrum of synthetic pentasaccharide PI. 
(B) *H 500 MHz spectrum of synthetic pentasaccharide P2. Characteristic proton 
5 chemical shifts of the constituent monosaccharides are marked accordingly. 

Figure 3 illustrates the ^H 500 MHz spectra of HIO decasaccharide (A). The line 
broadening is due to the presence of paramagnetic impurity; addition of deuterated 
EDTA provides a better resolved spectrum (B). The anomeric region expansion with the 
signals assignment is shown in (C). 

10 

DETAILED DESCRIPTION 
Heparin and heparan sulfate glycosaminoglycans (HSGAGs) are ceU surface 
polysaccharides that bind to a multitude of signaling molecules, enzymes, pathogens and 
modulate critical biological processes ranging from cell growth and development to 

15 anticoagulation and viral invasion. Heparin has been widely used as anticoagulant in a 
variety of clinical appUcations for several decades. The heterogeneity and complexity of 
HSGAGs pose significant challenges to their purification and characterization of 
structure-function relationship. 

Several analytical techniques includiug gel electrophoresis, chromatography and 

20 mass spectrometry have been successfiilly appUed to sequence small amounts of 
HSGAG ohgosaccharides. The prerequisite for the success of most of these techniques is 
predictable and controlled depolymerization of HSGAG oligosaccharide into smaller 
fragments using a combination of enzymatic and chemical degradation methods. 
Importantly, for some of the sequencing strategies, the use of multiple exo-enzymes are 

25 required to accurately determine the different modifications of the disaccharide units. 

A methodology was developed that utilizes experimental methods to characterize 
oligosaccharides. The invention, relates in some aspects to a method for analyzing an 
oligosaccharide, such as an HSGAG. These methods, in some embodiments, encompass 
deteimioing the presence or identity of an oligosaccharide m a sample througji the 

30 analysis of the oligosaccharide. In other embodiments, methods for assessing the purity 
of an ohgosaccharide in a sample are provided. The term "analyzing" as used hereru 
refers to the identification of one or more properties of an ohgosaccharide. In some 
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instances the analysis may be extensive and provide considerable information about the 
composition of an oligosaccharide. The term analyzing can encompass sequencing an 
oligosaccharide or determining the composition of an oligosaccharide. The term 
"determine the composition" refers to the identification of enough properties of an 
5 oligosaccharide that the oligosaccharide can be distinguished from other 
oligosaccharides. When the information regarding the properties is compiled using the 
numerical PEN firamework derived by the instant inventors the sequence of an 
ohgosaccharide, including order of building blocks and linkage information, can be 
compiled. 

10 Tlie analysis of the oUgosaccharide may involve identification of monomeric 

properties and linkage information. The importance of identifying the types of linkages 
between monosaccharides and disaccharides of oligosaccharides can be illustrated using 
HSGAGs as an example. The repeat unit of a HSGAG polymer is a disaccharide 
comprising a uronic acid (U), which can exist in two different epimeric forms - a-L- 

15 iduronic (I) of P-D-glucuronic (G), linked l->4 to a a-glucosamine residue (A). There 
are variations within the disaccharide unit in the form of sulfation or acetylation of the 
N-position of the glucosamine. Characteristic proton and ^^C chemical shifts have been 
identified for commonly occiirring monosaccharides and heparin and the relative 
abxmdance of these monosaccharides can be quantitatively determined by integrating the 

20 proton signals. In addition to characterization of the monosaccharides, the anomeric 
proton signals of the glucosaraines can be resolved fiirther to identify linkage 
information regarding the neighboring uronic acid (A-U linkage) with a defined epimeric 
and sulfation state. 

The composition of an oligosaccharide can be determined by the methods 
25 described herein using two or more experimental techniques to measure properties of the 
oligosaccharide. One of the experimental techniques is nuclear magnetic resonance 
(NMR). By utilizing NMR the number of experimental constraints required to sequence 
HSGAG oligosaccharides can be reduced. The NMR can be utilized with at least one 
other type of experimental technique. It has been demonstrated herein that the 
30 integration of NMR and capillary electrophoresis (CE) data sets with the help of the PEN 
fi-amework can reduce the need for NOESY/ROESY experiments which may require a 
lot more sample and more sophisticated instrum^ts for reasonable sensitivity. 
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Thus, methods are provided to analyze the oligosaccharides to determine the 
sequence of the ohgosaccharide. Using a numerical PEN framework, the information 
derived from the compositional analysis, e.g., two distinct linkage data sets, can be 
compiled to construct the sequence in a rapid, systematic and unbiased fashion. The 
5 numerical nature of the PEN framework facilitates moving between monosaccharide, 
disaccharide U-A and A-U linkage information using simple mathematical operations 
thus facilitating a systematic and unbiased way of rapidly arriving at a HSGAG 
sequence. 

The examples discussed below demonstrate the use of the methodology and the 
10 computational approach described herein. Examples 1 and 2 describe pentasaccharides 
containing both I2S and G. These examples demonstrate the value of determining the 
signature of the reducing end (methylation) for application of the analytical methods 
described herein. Once all of the A-U and U-A Imkages are determined, the knowledge 
of the reducing or non-reducing end enables the determination ;of the sequence by 
15 moving forward or backward. Decasaccharide HIO is one of the most complex 
decasaccharides that has been characterized and verified using a combination of 
analytical tools. 

Example 3 highlights the strengths of the methods described herein in 
comparison with other analytical tools that have been used for characterization of HIO in 

20 the past. EarUer sequencing approaches for this decasaccharide required numerous steps. 
Using the methods described herein we arrived at the sequence of HIO in an unbiased 
fashion by obtaining two distinct sets of A-U and U-A linkage information that were 
quantitatively determined using a minimal set of experimental data. This example 
illustrates the flexibility of the computational method to construct a Ust of all possible 

25 combinations of sequences satisfying the linkage and monosaccharide composition 
obtained from NMR data and elimiaation of sequences that did not satisfy the CE data. 

Thus, the hst of possible sequences can be constructed based on the identified 
properties, such as the charge, the nature and number of units of the oligosaccharide, the 
nature and number of chemical substituents on the units, disaccharide linkage, reducing 

30 and non-reducing ends, and the stereospecificity of the ohgosaccharide, which reveal 
information about the building blocks of the ohgosaccharide determined from the 
measurements of an experimental method. 



wo* 2004/055491 ^ -V'Simn^ k - . ^'^'H'. .* ^ PCT/fJS2a03/0 15850 



-8- 

The stnictuial properties of oligosaccharides may provide useful information 
about the function of the oligosaccharide. For instance, the properties of the 
oligosaccharide may reveal the entire sequence of units of the oligosaccharide, which is 
usefiil for identifying the oligosaccharide. Similarly, if the sequence of the 

5 oligosaccharide was previously unknown, the structural properties of the oligosaccharide 
are useful for comparing the oligosaccharide to known oligosaccharides having known 
fiinctions. The properties of the oUgosaccharide may also reveal that a oligosaccharide 
has a net charge or has regions which are charged. This information is useful for 
identifying compounds that the oUgosaccharide may interact with or predicting which 

10 regions of a oligosaccharide may be involved in a binding interaction or have a specific 
function. 

The invention is useful for identifying properties of oligosaccharides. A 
'"property" as used herein is a characteristic (e.g., stmctural characteristic) of the 
oUgosaccharide that provides information (e.g., structural information) about the 

1 5 oligosaccharide. A compilation of several properties of a oUgosaccharide may provide 
sufficient information to identify a chemical unit or even the entire oUgosaccharide but 
the property of the oUgosaccharide itself does not encompass the chemical basis of the 
chemical unit or oUgosaccharide. Due to the complexity of the oUgosaccharide, a 
property may identify a type of monomeric building block of the oligosaccharide. The 

20 units of the oligosaccharides have more variables in addition to its basic chemical 

structure. For example, the oUgosaccharide may be acetylated or sulfated at several sites 
on the building block, or it may be charged or xmcharged. Thus, one property of an 
oUgosaccharide may be the identity of one or more basic building blocks of the 
oligosaccharides. 

25 A basic building block alone, however, may not provide information about the 

charge and the nature of substituents of the saccharide or disaccharide. For example, a 
building block of uronic acid maybe iduronic or glucuronic acid. Each of these building 
blocks may have additional substituents that add complexity to the structure of the 
building block. A single property, however, may not identify such additional substitutes 

30 charges, etc., in addition to identifydng a complete building block of a oUgosaccharide. 
This information, however, may be assembled from several properties. Thus, a property 
of an oUgosaccharide as used herein encompasses a monosaccharide or disaccharide 
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building block of an oligosaccharide. The NMR methods described herein are useful for 
identifying information about basic monosaccharide building blocks. 

The type of property that will provide structural information about the 
oUgosaccharide is a property such as charge, molecular weight, nature and degree of 

5 sulfation or acetylation, or type of saccharide. Properties include but are not limited to 
charge, chirahty, nature of substituents, quantity of substituents, molecular weight, 
molecular length, compositional ratios of substituents or units, type of basic building 
block of a oUgosaccharide, hydrophobicity, enzymatic sensitivity, hydrophilicity, 
secondary structure and conformation (i.e., position of heUces), spatial distribution of 

10 substituents, ratio of one set of modifications to another set of modifications (i.e., relative 
amoimts of 2-0 sulfation to N-sulfation or ratio of iduronic acid to glucuronic acid), 
binding sites for proteins, and linkage information. Other properties will easily be 
identified by those of ordinary skill in the art, A substituent, as used herein is an atom or 
group of atoms that substitute a unit, but are not themselves the units. 

15 It has been discovered that the use of NMR can significantly reduce the number 

of experimental constraints required to derive enough property information to identify a 
complete sequence of an oUgosaccharide. By providing information regarding specific 
linkages and monosaccharide compositions NMR dramatically improves sequencing 
techniques. 

20 Thus, one embodiment of the methods utilizes the strengths of NMR to measure 

A-U Hnkages and an orthogonal set of U-A Unkage information obtained from CE' to 
construct the sequence of an oUgosaccharide. NMR is a powerful tool that can be used 
to determiae numerous parameters defining the sequence of an intact oUgosaccharide 
including monosaccharide composition, sulfation pattern and linkage between 

25 glucosamiae and uronic acid (A-U). These parameters can be readily determined 
independent of sequence length and variabiUty of building blocks, using a single series 
of simple ID proton and 2D COSY/TOCSY experiments. Thus, by combining the 
distinct linkage information between adjacent monosaccharides obtained from NMR (A- 
U linkage information) with the U-A linkage information obtained from a single 

30 capillary electrophoresis experiment to r25>idly arrive at the sequence of HSGAG 
oUgosaccharides. 
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Merpretation of NMR spectra of HSGAGs, in the past, has had certain 
limitations due to overlaps in proton signals and absence of measurable coupling 
constants. Also the sensitivity of this technique is lower than those based on detection of 
chromatographic effluents (Tumbull et al, 1999; Vives et aU 1999; Kreuger et al, 2001) 

5 and on mass spectrometry (Pope et al, 2001 ; Venkataraman et al , 1 999). Thus, it was 
very difficult to characterize samples that are only available in small quantities. It has 
been discovered that despite these limitations NMR can provide pov/erfiil analytical 
information useful in sequencing. 

NMR spectroscopy is an analytical tool that allows for the determination of 

10 molecular structure. Utilizing the magnetic properties of some nuclei, the nuclear spins 
of the nuclei can be oriented randomly v^^ith an external magnetic field. Oriented nuclei 
that are subsequently irradiated at the correct fi*equency will absorb energy and transition 
to a higher energy state. Upon relaxation this energy is emitted and detected in various 
NMR systems. This irradiation of the nuclei occur in pulses. In basic one dimensional 

15 (ID) NMR the excitation is produced from a single pulse and emitted radiation is 

detected as free induction decay OFID). In two dimensional (2D) NMR spectroscopy the 
nuclei is irradiated with two pulses, and acquisition of the FID occurs at many time 
points with a delay between the pulses. 

There are many types of 2D spectroscopy which include: COSY, TOSCY, 

20 NOESY and ROES Y. COSY (Correlated Spectroscopy) is helpftd in determining the 
energy that is arising from neighboring protons. This is helpftd if there is overlap or 
second order coupling. Spui-spin coupling in COSY allows the spectrum to yield 
through bond interactions. Another 2D NMR technique enable of measuring through 
bond interactions is TOSCY (Total Correlated Spectroscopy) which identifies the 

25 protons that produce signals within a spin system. COSY and TOSCY can be combined 
for a more powerful stmctural analysis. 

Other 2D NMR techniques allows the measurement of through space interactions. 
These methods are referred to as NOESY (Nuclear Overhauser Effect Spectrsocopy) and 
ROESY (Rotational Nuclear Overhauser Effect Spectroscopy). NOESY identifies the 

30 signals emitted from protons that are close in space by not directly connected by 

bonding. The NOESY spectra give through space correlations as the change in intensity 
of multiplets from neighboring nuclei upon irradiation can be extensively measured. In 
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instances where NOESY signals are weak, ROESY can be used, a similar technique that 
has cross peaks that are only negative. Other techniques may be used and would be 
apparent to those of ordinary skill in the art. 

The NMR spectroscopy useful in the methods described herein may be ID proton 
5 or 2D COSY/TOCSY in some embodiments. ID proton NMR and 2D COSY/TOCSY 
NMR spectra provide quantitative information on multiple parameters including 
monosaccharide composition, sulfation states and A-U linkage information that dejSne 
the sequence of an oligosaccharide. Furthermore, NMR provides an accurate method for 
direct quantification of the iduronic and glucuronic acid content in a sequence. 
10 In the method of capillary gel-electrophoresis, reaction samples may be analyzed 

by small-diameter, gel-filled capillaries. The small diameter of the capillaries (50 |im) 
allows for efficient dissipation of heat generated during electrophoresis. Thus, high field 
strengths can be used without excessive Joule heating (400 V/m), lowering the separation 
time to about 20 minutes per reaction run, therefor increasing resolution over 
15 conventional gel electrophoresis. Additionally, many capillaries may be analyzed in 
parallel, allowing amplification of generated oligosaccharide information. 

Currently, saccharide jQragments are detected in capillary electrophoresis by 
monitoring at 232 nm, the wavelength at which the A"**^ double bond, generated upon 
heparinase cleavage, absorbs. However, other detection methods are possible. First, 
20 nitrous acid cleavage of heparin fragments, followed by reduction with ^-sodium 

borohydride yields degraded fi-agments having a radioactive tag. This represents both 
a tag which may be followed by capillary electrophoresis (counting radioactivity) or 
mass spectrometry (by the increase m mass). Another method of using radioactivity 
would be to label the heparin firagment with S^^. Similar to the types of detection 
25 possible for ^H-labeled Augments, S^^ labeled fragments may be usefiil for radioactive 
detection (CE) or measurement of mass differences (MS). 

Especially ra the case of S^^, this detection will be powerful. Li this case, the 
human sulfotransferases may be used to label specifically a certain residue. This will 
give additional structural information. 
30 Nitrous acid degraded fragments, unlike heparinase-dOTved firagments, do not 

have a UV-absorbing chromophore. For CE, two methods may be used to monitor 
fragments that lack a suitable chromophore. First is indirect detection of fragments. We 
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may detect heparin jBragments with our CE methodology using a suitable background 
absorber, eg., 1,5-napthaIenedisulfonic acid. The second method for detection involves 
chelation of metal ions by saccharides. The saccharide-metal complexes may be 
detected using UV-Vis just like monitoring the unsaturated double bond. 
5 For determining the reducing or non-reducing ends a variety of experimental 

methods may be used including chemical degradation, end-labeling and capillary 
electrophoresis. In some embodiments, the ends may be determined by measuring a 
signature of a reducing or non-reducing end. For some oligosaccharides, the signature of 
the reducing end is methylation, and this signature may also be determined by capillary 

1 0 electrophoresis as well as other methods known in the art. 

The reducing end of an oUgosaccharide may be distinguished from the non- 
reducing end using mass tags, for instance. All of these tags involve selective chemistry 
with the anomeric OH (present at the reducing end of the oHgosaccharide), thus labeling 
occurs at the reducing end of the chain. One conamon tag is 2-aminobenzoic acid which 

15 is fluorescent. In general tags involve chemistry of the following types: (1) reaction of 
amines with the anomeric position to form imines 2-aniinobenzoic acid), hydrazine 
reaction to form hydrazones, and reaction of sencucarbazones with the anomeric OH to 
form semicarbazides. Commonly used tags (other than 2-aininobenzoic acid) include the 
following compounds: 

20 1. semicarbazide 

2 . Girard' s P reagent 

3. Girard*s T reagent 

4. p-aminobenzoic ethyl ester 

5. biotin-x-hydrazide 
25 6. 2-amuiobenzamide 

7. 2-aminopyridine 
8. anthranihc acid 

9. 5-[(4,6-dichlorotriazine-2-yl)arnind]-fluorescein 
10. 8-aininonaphthalene-l,3,6-trisulfonicacid 
30 11. 2-aniiQoacridone 

The properties, in some aspects of the invention, may be determined for the 
oligosaccharide in its intact or a fragmented form. Fragments of ohgosaccharides in 
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some embodiments of the invention can be produced by enzymatic digestion. la some 
embodiments the digestion is complete, or it may be a partial digestion. 

Oligosaccharide fragments may be degraded using enzymes such as heparin lyase 
enzymes or nitrous acid and they may also be modified using different enzymes that 
5 transfer sulfate groups to the positions mentioned earlier or remove the sulfate groups 
from those positions. The modifying enzymes are exolytic and non-processive which 
means that they just act once on the non-reducing end and will let go of the heparin chain 
without sequentially modifying the rest of the chain. For each of the modifiable 
positions in the disaccharide unit there exits a modifying enzyme. An enzyme that adds 
10 a sulfate group is called a sulfotransferase, and an enzyme that removes a sulfate group is 
called a sulfatase. The modifying enzymes include 2-0 sulfatase/ sulfotransferase, 3-0 
sulfatase/sulfotransferase, 6-0 sulfatase/sulfotransferase and N-deacetylase-N- 
sxilfotransferase. The. function of these enzymes is evident from their names, for 
example a 2-0 sulfotransferase transfers a sulfate group to the 2-0 position of an 
15 iduronic acid (2-0 sulfated glucuronic acid is a rare occmrence in the HSGAG chains) 
and a 2-0 sulfatase removes the siilfate group from the 2-0 position of an iduronic acid. 

HSGAG degrading enzymes include heparinase-I, heparinase- n , heparinase-IQ, 
D-glucuronidase and L-iduronidase. The heparinases cleave at the glycosidic linkage 
before a uronic acid. Heparinase I clips at a glycosidic linkage before a 2 -O sulfated 
20 iduronic acid. Heparinase -HI cleaves at a glycosidic linkage before an unsulfated 

glucuronic acid. Heparinase -11 cleaves at both Hep-I and Hep-Hi cleavable sites. After 
cleavage by the heparinases the uronic acid before which the cleavage occurs loses the 
information of iduronic vs. glucuronic acid because a double bond is created between the 
C4 and C5 atoms of the uronic acid. 
25 Glucuronidase and iduronidase, as their name suggests cleave at the glycosidic 

linkage after a glucuronic acid and iduronic acid respectively. Nitrous acid cUps 
randomly at glycosidic linkages after a N-sulfated hexosamine and converts the six 
membered hexosamine ring to a 5 membered anhydromannitol ring. 

As used harein, the term "oligosaccharide" is used interchangeably with the term 
30 "polysaccharide". An "oUgosaccharide" is a biopolymer comprised of linked saccharide 
or sugar units. As xised herein with respect to linked units of a oligosaccharide, "linked" 
or "linkage" means two entities are bound to one another by any physicochemical means. 
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Any linkage known to those of ordinary skiU in the art, covalent or non-covalent, is 
embraced. Such linkages are well known to those of ordinary skill in the art. Natural 
linkages, which are those ordinarily found in nature connecting the chemical units of a 
particular oligosaccharide, are most common. Natural linkages include, for instance, 

5 amide, ester and thioester linkages. The mits of an oligosaccharide analyzed by the 
methods of the invention may be linked, however, by synthetic or modified linkages. 
OUgosaccharides where the units are linked by covalent bonds will be most conomon but 
also include hydrogen bonded, etc. 

The oligosaccharide is made up of a plurality of chemical units. A "chemical 

1 0 unit" as used herein is a building block or monomer which may be linked directly or 
indirectly to other building blocks or monomers to form an oligosaccharide. The 
oligosaccharide preferably is a oligosaccharide of at least two different linked units. An 
oUgosaccharide is a biopolymer composed of monosaccharides linked to one another. In 
many ohgosaccharides the basic building block of the oligosaccharide is actually a 

1 5 disaccharide unit which may be repeating or non-repeating. Thus, a unit when used with 
respect to a oUgosaccharide refers to a basic building block of an oUgosaccharide and 
may include a monomeric building block (monosaccharide) or a dimeric building block 
(disaccharide). 

A "pluraUty of chemical units" is at least two imits linked to one another. 
20 The oligosaccharides may be native or naturally-occurring oUgosaccharides which occur 
in nature or non-naturally occurring oUgosaccharides which do not exist in nature. The 
oUgosaccharides typically include at least a portion of a naturally occurring 
oUgosaccharide. The oUgosaccharides may be isolated or synthesized de novo. For 
example, the oUgosaccharides may be isolated from natural sources e.g. purified, as by 
25 cleavage and gel separation or may be synthesized e.g., by chemical synthesis. 

A data structure for representing the properties of the oUgosaccharide is also 
provided- In some embodiments the data structure represents the properties as non- 
character values. These values in some embodiments can be a binary value. In some 
embodiments the building blacks are the types of monosaccharides and disaccharide 
30 linkages of the oUgosaccharide. 

The rapid sequencing methodology for polysaccharides using chemical and 
enzymatic tools followed by numerical analysis techniques is described in detail in 
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Venkataraman, G., et al.. Science, 286, 537-542 (1999), and U.S. Patent Applications 
Serial Nos. 09/557,997 and 09/558,137, both filed on April 24, 2000 and having 
common inventorship, all of which are specifically incorporated by reference. 

5 EXAMPLES 
Materials and methods 

The synthetic pentasaccharides PI and P2 corresponding to the active sequence 
of heparin for AT-in binding were a gift from M. Petitou, Sanofi-Synthelabo, Toxalouse, 
France. The decasaccharide HIO, kindly provided by RJ. Linhardt (University of Iowa), 
10 was obtained by fractionation of heparinase digest of pig mucosal heparin, on an AT-in 
column as described earlier (Toida et aL, 1996) 

NMR Spectroscopy: 

The oligosaccharide samples were prepared by dissolving 2 mg of the 
15 pentasaccharide and 150 ^g of HIO in 0.5 ml of D2O 99.99%. Due to signal broadening 
caused by paramagnetic ions in HIO, deuterated EDTA was added to the sample to 
remove these ions. (Neville et al, 1989) The ^H-NMR spectra were recorded at 500 
MHz on a Bruker AMX 500 spectrometer at 60 °C with presaturation of the residual 
water signals and with recycle delay of 12 seconds; a 45^ pulse was used. 2D DQF- 
20 COSY (Double Quantum Filtered-COSY) and TOCSY were measured in phase-sensitive 
mode using TPPI (Time Proportional Phase Incrementation), and a shifted square sine- 
bell fimction was appUed before Fourier transformation. 32 and 512 scans for each FID 
were used for the pentasaccharide and the decasaccharide, respectively. 

25 Compositional Analysis using capillary electrophoresis ( CE): 

Compositional analysis of the oUgosaccharides was completed by exhaustive 
enzymatic digest of a 30 nM sanq)le followed by capillary electrophoresis (CE) as 
described earlier (Venkataraman et al, 1999). Briefly, to 1 nmol of oUgosaccharide was 
added 200 nM of heparinases I, H, and III in 25 mM sodium acetate, 100 mM NaCl, 5 

30 mM calcium acetate buffer, pH 7.0. The reaction was allowed to proceed at 30°C 
overnight and then analyzed by CE in reverse polarity with a running buffer of 50 mM 
Tris/phosphate/10 \iM dextran sulfate, pH 2.5. 
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Incorporating CE and NMR data as constraints using PEN framework: 

The PEN is a nmnerical notation scheme that encodes the sulfation pattern of a 
disaccharide building block as a series of binary on/off states and epimerization of the 
uronic acid as a + or - sign bit leading to a signed hexadecimal coding scheme 
5 (Venkataraman et al, 1999). Although, the PEN frameworik was originally developed to 
encode a U-A disaccharide building block (Di), it was mathematically decomposed into a 
base4 code for representing the uronic acid monosaccharide (U) and a baseg code for 
representing the glucosamine monosaccharide (A) (Figure 1). Note that for the AU-A 
linkages the * used in the G/I position indicates that the epimeric state of the uronic acid 

10 is undetermined. It is also important to note that the signed hexadecimal codes 
representing the A-U linkages involves rearrangement of the 3 binary digits encoding A 
and two binary digits encoding U from the original PEN framework. As a resxilt of the 
rearrangement, the + and - sign is used to represent 6-0 sulfation (where + represents 
unsulfated and - represents sulfated) of the glucosamine instead of the epimeric state of 

15 the uronic acid since the 6-0 sulfation is in the left most position of the A-U disaccharide 
code. Therefore there is no "extra" binary digit that has been added for representing the 
A-U disaccharide and we still use the signed baseje hexadecimal code. Further the PEN 
framework was also used to encode an A-U disaccharide unit (Di') by transposiag the 3 
bits that encode for the sulfation state of the glucosamine with the 2 bits that encode the 

20 epimeric and sulfation state of the uronic acid (Table 1). The information obtained from 
NMR and CE data is shown in A. Columns 1-3 indicate the number of linkages between 
glucosamine residues (colored in gray) the uronic acids in column 4 obtained from NMR 
data. Columns 5-7 indicate the linkages between uronic acid and the glucosamines 
obtained from CE data. The sequences that satisfy the monosaccharide composition and 

25 A-U linkage information (Di') are shown in B. Application of the U-A linkages from CE 
data reduces Lnmr to the final conect sequence. 

Table 1: Seqnence assignment of HIO 
A 



Linkage information from CE and NMR 
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10 



B 



A-U linkage from NMR 




U-A linkage jfrom CE 


A* 




AnS,6X 


AU2S 


AnS.6S 


AnAc,6S 


A* 


3 










2 


hs 












1 


I 




1 






1 




G 






1 



Seauences satisfvine NMR data rLisjiuni') 




Di' linkage 




Sequence 




arrangement 








-5 -5 


-2 -4 


A* 


+DDC-57 


AU 


-5 -5 


-5 -2 


A* 


±DDD4-7 




-S -2 


-5 -4 


A* 


±DC-5D7 




-5 -4 


D -2 


A* 


±DD5C-7 


AU 


-5 -2 


-4 -5 


A* 


+DC-5DP 


AU 


-5 -4 


-2 -5 


A* 


±DD4-5F 


AU 


-2 -5 


-5 -4 


A* 


±C-5DD7 




-4 -5 


-5 -2 


A* 


±D5DC-7 




-2 -5 


-4 -5 


A* 


+C-5DSF 




-4 -5 


-2 -5 


A* 


±D5C-5F. 




-2 -4 


-5 -5 


A* 


±C-55DF 



Apply CE data 
as constraints 



Sequences satisfying 
CE and NMR Data 



1-D proton NMR spectrum along with the 2D COSY, HSQC (Heteronuclear 
15 Single Quantum Coherence) and TOCSY spectra provide data on the chemical shifts and 
coupling constants of most the ring protons of the constituent monosaccharides. This 
data was used to uniquely identify the monosaccharides (Ui and Ai) and obtain the 
number of monosaccharides for a given length of the sequence. In addition to the identity 
of the glucosamine monosaccharides, their characteristic anomeric chemical shifts were 
20 further resolved to identify their linkages to adjacent uronic acids (Ai-Ui linkages 
defining Di' disaccharides). The Ui, Ai and Di' information was used to build a list of all 
the possible sequences satisfying this data (Lnmr). The sequences in Lnmr represent a 
comprehensive sample space without any bias towards commonly occmring sequences. 
Disaccharide compositional anal>^is using CE provides accurate information on 
25 the sulfation pattern of a AU-A disaccharide, thus identifying all the U-A linkages (iDi) 
whose sign bit is not known due to the A"^^ unsaturated bond. Incorporating the 
disaccharide linkages dbDi obtained fix)m CE data eliminates most of the sequences fixim 
Lnmr converging on a single sequence. 
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Results 

Example 1: Pentasaccharide PI 

Se-veral characteristic chemical shifts of the monosaccharide anomeric protons 
were observed in the ID proton NMR spectrum of the pentasaccharide (Figure 2A). The 
5 ID proton signals along with the 2D COSY and TOCSY spectra were used to assign the 
monosaccharides. The signal patterns at 5.648, 5.438 and 5.041 ppm were assigned to 
the anomeric protons of N-sulfated glucosamines (Ans,6x). Further, the 6-0 sulfation of 
all these glucosamines were coafinned hy TOCSY. The signal at 3.43/57.5 ppm 
indicates the presence of an O-methyl group linked to the reducing terminal. In addition, 
10 the presence of the methyl group at the reducing end also accounts for the absence of the 
typical reducing end carbon chemical shift (92-93 ppm). The chemical shifts at 5.251 and 
4.635 are in agreement with an I^s and G respectively. The anomeric proton signals at 
5.648 and 5.438 are distinguished further as arising from Ans,6s-I2s and Ans,6s-G 
respectively. 

15 Integration of these peaks (Guerrini et al., 2001) gave the relative molar 

abundance of the glucosamines as A^s,6S-I2s:Ans,6S-G:Aks,6S ~ 1:1^1 and that of the 
uronic acids as l2s:G =1:1. Thus from the ID and 2D NMR data the identity A, = [58; 58; 
58 (Ans,6s)]> Ui = [I4 (l2s); 24 (G)] and relative abundance (m5=3, nl=n2=l) of the 
monosaccharides constituting the sequence were determined. Further the linkages Di' = 

20 [-5i6 (Ans,6S"I2s); -616 (Ans.6s-G)] were also obtained form IsIMR data. Based on this 
information from the NMR data there can be two possible (Ljimr) pentasaccharide 
sequences: 5^ I4 Sg 24 5% (Ans,6s - I2S - Ans,6S - G - ANs,6S,0Me) and 58 24 Sg I4 58 (Ans.6s 

- G - Ans,6S — I2S ~ AKS,6S.OMe)- 

Capillary electrophoresis of the fragments formed by complete digestion of the 
25 pentasaccharide with the heparinases resulted in two peaks corresponding to a trisulfated 
AU2sAns,6S and a disulfated disaccharide AUAns.6s thus defining ±Di = [iDi^; ±5i6]. The 
relative molar abundance of these two disaccharides was calculated as 1 :1 by integration 
of the CE signals and normalizing the peak areas using an internal cahbration. The 
migration time of the ±Di6 disaccharide was slightly different from the standard 
30 indicating that the methylated glucosamine is a part of the trisulfated ±Di6 disaccharide. 
Thus the data from CE fixes the sulfation state of methylated reducing end disaccharide. 
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Incorporating the constraints from CE data eliminated one of the sequences from Lnmr 
thus converging on Sg 24 Sg U Sg (Ans,6s - G - Ans,6s - las - ANs,6s,0Me). 

Example 2: Pentasaccharide (P2). 
5 From the ID proton spectrum (Figure 2B) the signal pattern (anomeric peaks at 

5.64/99.6 ppm, 5.50/98.1 ppm) is consistent with N-sulfated, 6-0-sulfated glucosamines 
(Ans,6s), and A* bearing an extra 3-0 sulfate group. Similar to Example 1, the signal 
pattern at 3.43/57,5 ppm corresponds to a glucosamine with a methylated reducing end. 
Also the anomeric chemical shift at 5.64 ppm arises from a Ans,6s linked to G as shown 
10 in Example 1. Signals at 5.20/101.7 ppm and 4.78/72.4 ppm agree with HI and H5 of ts 
residue, and the anomeric signal at 4.6/103.3 ppm with G (MuUoy and Johnson, 1987; 
Yates et al, 1996). Thus Ui = [I4 Ozs); 24 (G)] and Ai = [Sg; 58 (Ans,6s); 1% (A*)]. In this 
case we have only one of the two elements of Di' defined = [-616 (Ans,6S-G)]. 
Incorporating the inferences from NMR data as constraints we get Lnmr == 2 sequences: 

15 58 24 78 I4 58 (ANS,6S-G-A*-l2S-ANS.6S.0Me) and 78 I4 58 24 58 (A*-l2S-ANS.6S-G-ANS,6S,0Me) 

The disaccharide composition analysis using CE resulted in a single peak 
corresponding to a trisulfated disaccharide with a shifted migration time indicating the 
presence of the methylated glucosamine (±Di = [±Di6]). Using the data from CE one of 
the sequences from Lnmr was eliminated to give the right sequence 58 24 78 14 Sg (Ans.6s- 
20 G-A*-l2S-ANs,6s.0Me). This sequeuce is consistent with the notion that the Ans,6s-G 
linkage is resistant to cleavage by heparinase I, n and IH due to the presence of the 3-0 
sulfated A* (Shriver, Z et al 2000) thus resulting only in a single disaccharide observed 
using CE. 

25 Example 3: Decasaccharide (HIO): 

While the first two synthetic pentasaccharide examples clearly outline the 
methodology of our approach, the method is better illustrated by the longer and more 
complex heparin derived oligosaccharide (HIO). This sequence is presently among the 
most complex heparin derived ohgosaccharide sequenced to date. It is important to point 

30 out that much eflfort has gone into the isolation and sequencing of HIO (Toida et aLy 
1996). Due to its conq)lexity, there were inaccuracies in its structure determination in the 
past and only recently using a combination of analytical tools, this sequence was 
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established (Venkataraman et al,\999\ Shriver et al, 2000). NMR spectroscopy has 
been used in the past to corroborate its sequence (Shriver et al, 2000) but only the 
monosaccharide composition was established and there was bias in the interpretation of 
the NMR data based on the determined sequence. Using these examples we highlight the 
5 flexibility of our approach in providing an unbiased assignment of complex heparin 
derived oligosaccharide structures. 

The signal line broadening of the proton spectrum of HI 0 (Figure 3 A) is caused 
by the complexation of paramagnetic ions with the negatively charged groups. Addition 
of EDTA provides a better resolved spectrum (Figure 3B) by removal of these 
10 paramagnetic ions. (Neville, et al., 1989) 

The assignment of the anomeric signals (Figure 3C) and their respective proton 
patterns (Table 2), were carried out by COSY and TOCSY experiments. 

Table 2: chemical shifts of the constituent monosaccharides of the HIO sample. 

15 Chemical shifts are given in ppm downfield from trimethylsilyl propionate (TSP) as 



standard. 





AU 






Ans 


I 


An Ac 


G 


A* 


HI 


5.521 


5.42 


5.20 


5.369 


5.019 


5.386 


4.668 


5.460 


H2 


4.635 


3.31 


4.34 


3.275 


3,767 


3.940 


3.415 


3.483 


H3 


4.314 


3.65 


4.22 


3.65 


4.126 


3.77 


3.71 


4.571 


H4 


6.001 


3.71(3,S2Jf 


4.13 


3.77 


4.102 


3.75 


3.804 


4.040 


H5 




4.01 


4.87 


4.01 


4.829 


4.01 


3.71 


4.261 


H6 




4.40 




4.40 




4.35 




nd 



^ - two monosaccharide residues 



^ - H4 chemical shift of the Ans,6S residue following the non reducing AU unit. 

20 The signals detected between 4.2-4.4 ppm are in agreement with H-6 proton from 

6-0 sulfated glucosamine. However since this chemical shift hes in the crowded area of 
the spectrum and due to the presence of minor impurities in the sample it was not 
possible to accurately determine the molar abundance of glucosamines containing the 6^ 
O sulfate groups. However disaccharide compositional analysis of HIO using CE 

25 indicated the presence of 3 major disaccharide components - AUis-Ans. 6s, AU-Anac, es, 
AU-A* in the ratio 3:1:1, respectively, giving iDi = [±D; ±4; ±7]. Thus the data from CE 
fixed the 6-0 sulfation of all the glucosamines. 



K'O I _ . PCTAJS2003/0t5850 



-21- 

The relative abxmdance of the glucosamine monosaccharides calculated by signal 
integration were 5% (Ans.6s): 1% (A*): 48 (Anac.6s) = 3:1:1, thus Ai = [5$; Ss; 5%\ 1%\ 
The two a anomeric signals at 5.20 and 5.019 ppm arise from 2-0 sulfated and non 
sulfated iduronic acid, respectively, as demonstrated by the chemical shift pattern. The 

5 only (3 proton signal of the spectrum (at 4.669 ppm) belongs to a glucuronic acid residue. 
Protons at 6 ppm and 5.521 ppm belong to the H4 and HI of the AU residue. The H2 at 
4.635 ppm indicates that the xmsaturated uronic acid residue is 2-0-sulfated. The relative 
abundance of the uronic acid monosaccharides was calculated as fcs: I: G: AU2S were 
identified in the ratio 2:1:1:1 respectively thereby defining Ui = [I4; U; O4; (*1)4] 

10 (where * stands for AU since this bit is not defined). 

The chemical shift of the signal at 5.369 ppm agrees with a Ans,6s hnked to I. The 
anomeric chemical shift of a Anac, 6S is distinct for Anac,6S-I (5.14-5.18 ppm) and 
Anac,6S-G (5.30-5.36 ppm) Imkages (Cohen, 1999; Chuang, et al, 2001). The anomeric 
proton of Anac,6s at 5.386 ppm confirms the presence of Anac.6X-G linkage in the 

15 sequence. The chemical shift at 5.42 agrees with both Ans,6S linked to fcs and Ans,6S at 
the reducing end. Since the AU residue is linked to an Ans.6s unit and a second Ans.6S is 
linked to I, two possibilities are left for the reducing end, one with Ans,6s and the other 
with A*. However, the chemical shift pattern associated with A* (HI, 5.464ppm; H2, 
3.480ppm; H3, 4.564ppm, H4, 4.041ppm) is the same as found by Yamada et al (1993) 

20 for a heparin tetrasaccharide with this residue at the reducing end (chemical shifts in the 
Yamada et al paper are systematically shifted about -0.03 ppm with respect to our 
values). Thus the signal pattern of A* is consistent with its location at the reducing end. 
Based on the relative abundance of these signals all the elements of Di' were defined as 
[-5i6; -5i6; -4i6; -2i6]. Translating the Ai, Ui and Di* to constraints using the PEN 

25 framework Lnmr=12 sequences were obtained (Table 1). 

Eliminating the sequences from L>fmr that do not contain disaccharide linkages 
-coiresponding4o ±Di resulted in a single sequence DDD4-7 which isxonsistent-with the. 
HIO sequence obtained earUer (Shriver, et al, 2001).. 

30 The foregoing written specification is considered to be sufficient to enable one 

skilled in the art to practice the invention. The present invention is not to be limited in 
scope by examples provided, since the examples are intended as a single illustration of 
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one aspect of the invention and other functionally equivalent embodiments are within the 
scope of the invention. Various modifications of the invention in addition to those 
shown and described herein will become apparent to those skilled in the art from the 
foregoing description and fall within the scope of the appended claims. The advantages 
5 and objects of the invention are not necessarily encompassed by each embodiment of the 
invention. 

All references, patents and patent publications that are recited in this application 
are herein incorporated by reference. 
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We claim: 

CLAIMS 

1. A method of determining the composition of an oligosaccharide, comprising: 
obtaining a measurement of a first property of the ohgosaccharide using NMR 

5 spectroscopy, and obtaming a measiirement of a second property of the ohgosaccharide 
by a second experimental method, wherein the first and second properties determine the 
composition. 

2. The method of claim 1, wherein the second property of the ohgosaccharide is 
10 measured by capillary electrophoresis. 

3. A method of analyzing an ohgosaccharide, comprisiag: obtaining a 
measurement of a first type of disaccharide linkage of the oligosaccharide by a first 
experimental method, and obtaining a measm^ement of a second type of disaccharide 

15 hnkage of the oligosaccharide by a second experimental method, to analyze the 
ohgosaccharide. 

4. The method of claim 3, wherein the first type of disaccharide linkage is 
measured by NMR spectroscopy. 

20 

5. The method of claim 3, wherein the second type of disaccharide linkage is 
measured by capillary electrophoresis. 

6. A method of analyzing an ohgosaccharide, comprising: identifying a first 
25 property of the oligosaccharide by NMR spectroscopy, and identifyhag a second property 

of the ohgosaccharide by capillary electrophoresis, to analyze the ohgosaccharide. 

7. The method of claim 1 or 4, further comprising: determining possible 
sequences of the ohgosaccharide that are consistent with the measurement firom the 

30 NMR spectroscopy and second experimental method. 
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8. The method of claim 1 or 4, further comprising: constructing a list of possible 
sequences based on the measurement from the NMR spectroscopy, and eliminating 
sequences from the list of possible sequences that are not consistent with the 
measurement of the second experimental method. 

5 

9. The method of claim 1, 3, 4, 7 or 8, wherein the second experimental method 
distinguishes the reducing and non-reducing ends of the oligosaccharide or fragments 
thereof, 

10 10, The method of claim 9, wherein the second experimental method includes 

chemical degradation. 

11. The method of claim 9, wherein the second experimental method includes 
end-labeling. 

15 

12. The method of claim 1, 3, 4, 7 or 8, wherein the second experimental method 
determines the signature of the reducing end of the oUgosaccharide or fragments thereof. 

13. The method of claim 12, wherein the signature of the reducing end is 
20 determined with capillary electrophoresis. 

14. The method of claim 1, 2, 4, 6, 7 or 8, wherein the NMR spectroscopy 
includes the determination of the sulfation pattern of the oligosaccharide or fragments 
thereof. 

25 

15. The method of claim 1, 3, 4, 7 or 8, wherein the second experimental method 
allows the determination of the sulfation pattern of the second type of disaccharide 
linkage. 



16. The method of any one of claims 1-8, further comprising: obtaining a 
measinre of an additional property of the oligosaccharide by a third experimental method 
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to further eliminate sequences not consistent with measurements obtained from the third 
experimental method. 

17. The method of claim 1, 2, 4, 6, 7 or 8, wherein the NMR spectroscopy is ID 
5 proton or 2D COSY/TOCSY. 

18. The method of claim 1, 2, 4, 6, 7 or 8, wherein the NMR spectroscopy is 
performed on the oligosaccharide in its intact form. 

10 19. The method of claim 1, 3, 4, 7 or 8, wherein the second experimental method 

includes digesting the oligosaccharide to a fragmented form. 

20. The method of claim 19, wherein the fragmented form is produced by 
enzymatic digestion. 

15 

21 . The method of claim 20, wherein the enzymatic digestion is complete. 

22. The method of claim 1, wherem the NMR is performed to identify and 
quantify both reducing and non-reducing ends. 

20 

23. A method of generating a list of possible sequences of an oligosaccharide, 
comprising: defining a set of properties of the oligosaccharide by performing NMR 
spectroscopy and a second experimental method, wherein the NMR spectroscopy 
provides a measurement of a first type of disaccharide linkage and the second 

25 experimental method provides a measurement of a second type of disaccharide Unkage, 
and constructing a list of possible sequences based on the set of properties of the 
oligosaccharide. 

24. The method of claim 23, wherein the NMR spectroscopy includes a measure 
30 of the monosaccharide composition of the oligosaccharide. 
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25. The method of claim 23, further comprising a data structure which represents 
the properties as non-character values. 

26. The method of claim 25, wherein the data structure includes a value for each 
5 type of monosaccharide. 

27. The method of claim 25, wherein the data structure encodes a value for each 
type of disaccharide linkage. 



28. The values of claim 25, wherein the values are binary. 
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laim Claims as Pending 
after Amendment dated 11/13/07 

1 . (Currently Amended) A method for quantifying domains within a punctate 
distribution comprising the steps of: 

acquiring at least one digital image of a punctate distribution, wherein the 
at least one digital image is a fluorescence photomicrographic image of the 
nucleus of a eukaryotic cell; 

identifying domains within the punctate distribution by image 
segmentation of the digital image, wherein the step of identifying domains 
further comprises the step of defining domains; 

computing organizational parameters; and 

performing statistical analysis of the organizational parameters. 

2. (Original) The method of claim 1, wherein the step of acquiring digital images 
further comprises the step of deconvoluting the digital image. 

3. (Cancelled) 

4. (Original) The method of claim 1 wherein the step of identifying domains further 
comprises the step of providing a mask of a portion of the digital image. 

5. (Original) The method of claim 1 wherein the step of identifying domains further 
comprises the step of producing a masked image. 

6. (Original) The method of claim 1 wherein the step of identifying domains further 
comprises the step of determining an image threshold. 

7. (Previously Presented) The method of claim 1 wherein the step of identifying 
domains further comprises the step of selecting a threshold that maximizes the 
number of detectable domains. 

8. (Cancelled) 
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9. (Original) The method of claim 1 wherein the organizational parameters include 
at least two of the number of domains, mean domain size, median domain size, 
standard deviation of domain size, variance of domain size, skevraess of domain 
size, kurtosis of domain size, coefficient of variation of domain size; index of 
dispersion of domain size; mean nearest neighbor distance, median nearest 
neighbor distance, standard deviation of nearest neighbor distance, variance of 
nearest neighbor distance, skevraess of nearest neighbor distance, kurtosis of 
nearest neighbor distance, coefficient of variation of nearest neighbor distance; 
index of dispersion of nearest neighbor distance, domain density. Re, Ro/Re, Ve, 
VoA^e, Rp, Rd, or Rd/Rp. 

1 0. (Original) The method of claim 9 further comprising the step of performing factor 
analysis on the organizational parameters. 

11. -26. (Cancelled) 
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